Ensemble Learning for Hidden Markov Models

نویسنده

  • David J C Mackay
چکیده

The standard method for training Hidden Markov Models optimizes a point estimate of the model parameters. This estimate, which can be viewed as the maximum of a posterior probability density over the model parameters, may be susceptible to over-tting, and contains no indication of parameter uncertainty. Also, this maximummay be unrepresentative of the posterior probability distribution. In this paper we study a method in which we optimize an ensemble which approximates the entire posterior probability distribution. The ensemble learning algorithm requires the same resources as the traditional Baum{Welch algorithm. The traditional training algorithm for hidden Markov models is an expectation{ maximization (EM) algorithm (Dempster et al. 1977) known as the Baum{Welch algorithm. It is a maximum likelihood method, or, with a simple modiication, a penalized maximum likelihood method, which can be viewed as maximizing a posterior probability density over the model parameters. Recently, Hinton and van Camp (1993) developed a technique known as ensemble learning (see also MacKay (1995) for a review). Whereas maximum a poste-riori methods optimize a point estimate of the parameters, in ensemble learning an ensemble is optimized, so that it approximates the entire posterior probability distribution over the parameters. The objective function that is optimized is a vari-ational free energy (Feynman 1972) which measures the relative entropy between the approximating ensemble and the true distribution. In this paper we derive and test an ensemble learning algorithm for hidden Markov models, building on Neal

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Ensemble Training for Hidden Markov Models using Random Relative Node Permutations

Hidden Markov Models have many applications in signal processing and pattern recognition, but their convergencebased training algorithms are known to suffer from oversensitivity to the initial random model choice. This paper focuses upon the use of model averaging, ensemble thresholding, and random relative model permutations for improving average model performance. A method is described which ...

متن کامل

Introducing Busy Customer Portfolio Using Hidden Markov Model

Due to the effective role of Markov models in customer relationship management (CRM), there is a lack of comprehensive literature review which contains all related literatures. In this paper the focus is on academic databases to find all the articles that had been published in 2011 and earlier. One hundred articles were identified and reviewed to find direct relevance for applying Markov models...

متن کامل

Boosting Input/Output Hidden Markov Models for Sequence Classification

Input/output hidden Markov model (IOHMM) has turned out to be effective in sequential data processing via supervised learning. However, there are several difficulties, e.g. model selection, unexpected local optima and high computational complexity, which hinder an IOHMM from yielding the satisfactory performance in sequence classification. Unlike previous efforts, this paper presents an ensembl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997